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Abstract 

In this paper, we investigate the evolution of the network entropy for consensus dynamics in classical 
or quantum networks. We show that in the classical case, the network entropy decreases at the con¬ 
sensus limit if the node initial values are i.i.d. Bernoulli random variables, and the network differential 
entropy is monotonically non-increasing if the node initial values are i.i.d. Gaussian. While for quan¬ 
tum consensus dynamics, the network’s von Neumann entropy is in contrast non-decreasing. In light of 
this inconsistency, we compare several gossiping algorithms with random or deterministic coefficients 
for classical or quantum networks, and show that quantum gossiping algorithms with deterministic 
coefficients are physically related to classical gossiping algorithms with random coefficients. 
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1 Introduction 

With the basic idea being able to be traced back to [2], problems of distributed consensus seeking have been 
widely studied in the past decade sparked by the work of 13 Sj. The states of a group of interconnected 
nodes can asymptotically reach the average value of their initial states via neighboring node interactions 
and simple distributed control rule 13 El S|, which forms a foundational block for the further development 
in the broad range of control of network systems |5] . The understanding of distributed consensus algorithms 
has been substantially advanced in aspects ranging from convergence speed optimisation and directed links 
to switching interactions and nonlinear dynamics [3 13 Eli- 

On the other hand, recent work CI1II2 brought the idea of distributed averaging consensus to quantum 
networks, where each node corresponds to a qubit, i.e., a quantum bit m- In quantum mechanics, the 
state of a qubit is represented by a density matrix over a two-dimensional Hilbert space H, and the state 
of a quantum network with N qubits corresponds to a density matrix over the iV’th tensor product of 
H. The concepts regarding the network density matrix reaching a quantum consensus were systematically 
developed in El and it has been shown that a quantum consensus can be reached with the help of 
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quantum swapping operators for both continuous-time and discrete-time dynamics mmm- In fact, 
the two categories of dynamics over classical and quantum networks can be put together into a group- 
theoretic framework |12) . and quantum consensus dynamics can even be equivalently mapped into some 
parallel classical dynamics over disjoint subsets of the entries of the network density matrix [T3], 

In this paper, we make an attempt to look at the relation between the two categories of dynamics from 
a physical perspective, despite their various consistencies already shown in P2J [Hj. The density matrix 
describes a quantum system in a mixed state that is a statistical ensemble of several quantum states, 
analogous to the probability distribution function of a random variable m- First of all we investigate the 
evolution of the network entropy for consensus dynamics in classical or quantum networks. We show that in 
classical consensus dynamics, the network entropy decreases at the consensus limit if the node initial values 
are i.i.d. Bernoulli random variables, and the network differential entropy is monotonically non-increasing 
if the node initial values are i.i.d. Gaussian. While for the quantum consensus dynamics, the network’s 
von Neumann entropy is in contrast non-decreasing. These observations suggest that the two types of 
consensus schemes may have different physical footings. Then, we compare several gossiping algorithms 
with random or deterministic coefficients for classical or quantum networks and present novel convergence 
conditions for gossiping algorithms with random coefficients. The result shows that quantum gossiping 
algorithms with deterministic coefficients are physically consistent with classical gossiping algorithms with 
random coefficients. 

The remainder of the paper is organized as follows. Section 2 presents the problem of interest as well as 
the main results. Section 3 presents the proofs of the statements. Finally Section 4 concludes the paper. 

2 Entropy Evolution and Classical/Quantum Gossiping 

For a network with N nodes in the set V = {1,..., N} with an interconnection structure given by the 
undirected graph G = (V,E), the standard distributed consensus control scheme is described by the 
dynamics 

jX(t) = - L G X(t ), (1) 

where X(t ) = (Xi(t).. . Wv(f)) T with Xi(t) £ IR representing the state of node i £ V, and Lq is the 
Laplacian of the graph G. Here, we refer to [5] for a detailed introduction as well as for the definition of 
the graph Laplacian. 

Also consider a quantum network with N qubits indexed in the set V = {1,..., IV}. We can introduce 
a quantum interaction graph G = (V, E), where {i,j} £ E specifies a swapping operator between the 
two qubits. The state of each qubit is represented by a density matrix over the two-dimensional Hilbert 
space Ti, and the network state corresponds to a density matrix over 'H® N , the IV’th tensor product of H. 
Continuous-time quantum consensus control can be defined by mm 

J t P( t )= { u jkP(t)Uj k - p(i)), (2) 

(j.fcje e 


2 


where p(t) is the network density matrix, and Ujk is the swapping operator between the qubits j and k 
(see m 11 j f° r details on the definition and realization of the swapping operators). 

2.1 Entropy Evolution 

Let the graph G be connected for either the classical or the quantum dynamics. It has been shown that 
for the system (JT]) , there holds (e.g., m 

X(oo) : = lim X(t) = 11 T A(0 )/N, 

t—> OO 

where 1 is the iV x 1 all-ones vector. For the system Q, there holds mmm 

p(oo) := Hm p(t) = ^ U v p(0)Ul, 

where ip is the permutation group and U n represents the quantum permutation operator induced by 
vr e ip. The conceptual consistency of the systems ([!]) and ([2]), as well as the logical consistency of the 
two consensus limits, have been discussed in [HUH!- 

The Shannon entropy is a fundamental measure of uncertainty of a random variable |il|. The entropy 
H(Z), of a discrete random variable Z with alphabet Z is defined as 

H ( z ) : = - l °ZP( z )- 

Here log is to the base 2 and p(-) is the probability mass function. The differential entropy h(Z) of a 
continuous random variable Z with density f(z) is defined as 

K z ) : = - J f(z) log f(z)dz, 

where S is the support of Z. As a natural generalization of the Shannon entropy, for a quantum-mechanical 
system described by a density matrix p, the von Neumann entropy is defined as |10] 

S(p) = — tr(plogp), 

where tr(-) is the trace operator. 

We present the following result for classical consensus dynamics. 

Theorem 1 (i) Let Xj(0) be independent and identically distributed (i.i.d.) Bernoulli random variables 
with mean p € (0,1). Then NX( oo) obeys binomial distribution. Therefore, for the system there holds 
H(X( 0)) = A^fplogp -1 + (1 — p) log(l — p) _1 ], and H(X( oo)) ~ | log (2neNp(l - p )) + O(^). 

(ii) Let Xj(0) be i.i.d. Gaussian random variables with mean p and variance a 2 . Then h(X(t)) is a 
non-increasing function over [0, oo) for the system 0. 

Here H ( X(t)) and h(X(t)) are defined for the random vector X(t). For the Gaussian case, h(X (oo)) does 
not yield a finite number since A"(oo) = 11 1 X(0)/N becomes degenerate. We can however conveniently 
use h(X( oo)) = h(Xi( oo)) and a simple calculation gives 

M*(0)) = y [log(27rea 2 )], h(X( oo)) = \ 
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Figure 1: The quantum interaction graph G with 4 (classical or quantum) nodes. 


For quantum consensus dynamics, the following result holds. 

Theorem 2 For the system |I|), S(p(t)) is a non-decreasing function over [0, oo). 

The above results reveal that, the network entropy in general decreases with classical consensus dy¬ 
namics, but increases with quantum consensus dynamics. This appears to be surprising noticing their 
consistencies pointed out in mum. However, although the systems ([Tj) and Q can be formally united 
(cf., |T4|). X(t) represents a random variable in the classical world, while p(t) is a probability mass function 
by its definition. 

2.2 Numerical Examples 

We now provide a simple example illustrating the derived results. Consider a graph with 4 (classical or 
quantum) nodes as shown in Figure [lj 

For the classical case, we take the X,;(0) as an i.i.d. standard Gaussian random variable. For the quantum 
case, we take the initial density matrix as 

p ( o ) = |01 + -)(01 + -| 

under the Dirac notion m ■ The evolution of the differential entropy and the von Neumann entropy with 
the classical and quantum consensus dynamics is plotted, respectively, in Figure [2] 

2.3 Gossiping with Random/Deterministic Coefficients 

In this subsection, we provide a physical perspective to explain the observations in Theorem [l] and Theo¬ 
rem [2] by investigating a serial of classical or quantum gossiping algorithms with random or deterministic 
coefficients. 

A random gossiping process is defined as follows. Consider N nodes in the set V with an underlying 
interaction graph G which is undirected and connected. Time is sequenced by k = 0,1,.... At time k, a 
node i is first drawn with probability 1 /N, and then node i selects another node j who shares a link with 
node i in the graph G with probability l/deg(z). Here deg(i) is the degree of node i in the graph V. In 
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Figure 2: The evolution of the network entropy for classical (left) and quantum (right) dynamics, respec¬ 
tively. 


this way, a random pair {*, j} is selected. Additionally, let b k , k = 0,1,... be a sequence of i.i.d. Bernoulli 
random variables with mean 1/2, which are also independent of any other possible randomness. 

• In the classical case, each node i holds a real-valued state Xi(k ) £ II at time k. Their initial 
states, Ab(0),..., Wv(0), are assumed to be N (not necessarily independent) random variables over 
a common underlying probability space. The marginal probability (mass or density) distribution of 
node Xi(k) is denoted as p\{-)- When the pair {i,j} is selected at time k, only the two selected 
nodes update their values and we consider the following algorithms. 


[Al] (Classical Gossiping with Deterministic Coefficients, m Node i and j update their values as 

Xi (k + 1) = Xj (k + 1) = ±Xi (k) + ^Xj (AO. (3) 

[A2] (Classical Gossiping with Random Swapping, JT5f ) Node i and j update their values as 


Xi(k + 1) = b k Xi(k ) + (1 - b k )Xj(k)] 
xffik + !) = (! — b k )Xi(k) + bkXffik). 


(4) 


• In the quantum case, each node i represents a qubit and p(k) is the network density matrix at 
time k. When the qubit pair {i,j} is selected at time k, we correspondingly consider the following 
algorithms. 


[AQ1] (Quantum Gossiping with Deterministic Coefficients, //!}/) The quantum network updates its 
density matrix as 

P(k + 1) = \p(k) + ^Uijp^ulj. (5) 

[AQ2] (Quantum Gossiping with Random Swapping, '[lffi) Node i and j update their values as 

p(k + 1) = b k p(k) + (1 - bk)Uijp(k)uJj. (6) 
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We state a few immediate facts for the algorithms [Al], [A2], [AQ1], and [AQ2], 

(i) The evolution of E{X(fc)} is exactly the same along with the algorithms [Al] and [A2]. Similar 
conclusion holds also for the algorithms [AQ1] and [AQ2], 

(ii) Algorithms [Al] and [AQ1] are algorithmically equivalent , in the sense that [AQ1] can be divided 
into a set of parallel algorithms in the form of [Al] over disjoint entries of p(t) (see [14J for a 
thorough treatment via vectorizing p(t)). Similarly, the algorithms [A2] and [AQ2] are algorithmically 
equivalent. 

(iii) Algorithms [A2] and [AQ1] are physically equivalent, in the sense that for a sequence of underlying 
random variables X(k) evolving along [A2], their joint probability mass/density function, denoted 
fk(x i,... ,xjv) (which is exactly the physical interpretation of the density matrix p(k)) will evolve 
in the form of [AQ1] (cf., [12]): 

fk+l{x\i ■ ■ ■ i Xn) = yfk^Xl, • . ■ , Xi, . . . , Xj, . . . , Xjv) T ~fk(x i, . . . , Xj, . . . , Xi, .. . , Xn) (7) 

if the pair {i,j} is selected at time k. 

Recall that a Markov chain is ergodic if it is both aperiodic and irreducible [18]. We present the following 
result establishing the limiting behaviors of the algorithm [A2], which is consistent with the observations 
of the entropy evolution in Theorems [I] and [2] as well as the point (iii) above. 

Theorem 3 For the algorithm [A2], there holds that 

(i) {X(1’)}^1 0 forms an ergodic Markov chain given X(0); 

(ii) lim^oo p[,(-) = J2iLi Po(')/N, where the convergence is exponentially fast under the distance induced 
by l 1 (for X (0) given by discrete random variables) or £} (for continuous X(0)) norms. 

Remark 1 One can also consider the case in a gossiping process when two selected node i and j update 
their values by (Classical Gossiping with Random Coefficients) 

[AT] X l (k + l) = Xj{k + l) = b k X i {k) + (l-b k )X j (k). (8) 

From the second Borel-Cantelli Lemma (e.g., Theorem 2.3.6. in m), that almost surely, Xfik) reaches 
a common value for all i £ V in finite time along the algorithm [Al ’]. Interestingly, it is easy to see that 
the evolution of the pl(-) is the same along the algorithms [Al’] and [A2]. 

Remark 2 The scheme of the algorithms [A2] was briefly discussed in Section 6.2 of / 12], which is also 
a form of gossiping algorithms with unreliable but perfectly dependent link communications studied in m 
with mixing coefficient one. Here Theorem advances the previous understandings by showing that the 
algorithm [A2] defines an ergodic Markov chain for any given initial condition as well as presenting the 
detailed convergence properties of the marginal distribution functions for both discrete and continuous 
X(0). 
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Remark 3 We assume that the mean of the bk is 1/2 just for the ease of presentation. It is clear from 
the proof that Theorem [3| holds for arbitrary E{6fc} 6 (0,1). The ergodicity plays an essential role in 
the convergence of the marginal distributions: the case with E{6fc} = 0 fails because X(k) is no longer 
aperiodic; the case with E {bk} = 1 fails because X(k ) is no longer irreducible. 


3 Proofs of Statements 

This section provides the proofs of Theorems 01! and [3} 


3.1 Proof of Theorem [Q 

(i) The fact that H(X( 0)) = -/Vjplogp^ 1 + (1 — p)log(l — p) -1 ] follows straightforwardly from the in¬ 
dependence of the Xj(0). On the other hand, Xjfoo) = Xj(0)/N follows a Binomial distribution 

B(N,p ) whose entropy is well-known to be ^ log (27refVp(l — p)') + O(j^). Since A/(oo) = Xj(oo) for all 
i,j G V, there holds H(X( oo)) = H(Xi( oo)). This proves (i). 

(ii) The solution X ( t ) of the system @ is 

X(t) = e~ tLa X(0). (9) 


As a result, for any t > 0, X(t) is a Gaussian random vector. Then 


1 


h(X(t)) = -log (27re) A E [[X(t) - E(X(t))][X(t) - E (X(t))] 


= - log 
2 & 


(2nea 2 ) N e~ 2tLa 


where | • | represents the matrix determinant. 


We take e > 0 and compare h(X(t + e)) with h(X(t)). There holds from (10) that 


h(X(t + e)) = h(X(t)) + ^ log |e 2eLc \ 


( 10 ) 


( 11 ) 


Since Lq is the Laplacian of a connected undirected graph G, Lq has a unique zero eigenvalue, and all 
non-zero eigenvalues of Lq are positive (cf., [5j). Consequently, all eigenvalues of e~ 2cLct are positive and 
no larger than one, which yields that 

|e” 2eiG | < 1. 

This proves h{X(t + e)) < h(X(t)). Since e is chosen arbitrarily, we conclude that h(X(t)) is a non¬ 
increasing function. The calculations of h(X(0)) and h(X(oo)) are straightforward. 

We have now completed the proof of Theorem [l] □ 


3.2 Proof of Theorem [2] 

The proof relies on the following lemma. 
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m,r(e) = 1 


Lemma 1 Let e > 0 and fix s > 0. For the system 0), there exist m 7r (e) > 0, ir E ^ with ^jre^ 
such that 


p(s + e) = Y m^{e)U n p(s)Ul. 

TrGg! 


( 12 ) 


Proof. Define a set 

= co(u n p{s)Ul :vrG^), 

where co(-) stands for the convex hull. It is straightforward to see that UjkpU^ k E E s if p E T, s . As a result, 
S s is an invariant set of the system <© in the sense that p(t) E for all t > 0 as long as p( 0) E The 
desired lemma thus follows immediately. □ 

Recalling that the von Neumann entropy S(p) is a concave function of p, and that S(p) = S(UpU') for 
any unitary operator U, we conclude from Lemma [l] that 


S(p{s + e)) = S^Y m % (e)U n p(s)U^ 

TrGg! 

> Y m n (e)s(u n p(s)U^j 

= S(p(s)) (13) 


for any e > 0 and s > 0 in light of the fact that U n is unitary for all ir E ^3. This proves that S(p(t)) is a 
non-decreasing function and Theorem [2] holds. □ 


3.3 Proof of Theorem [3] 


(i). First of all it is clear that {X(fe)}^ c ( 0 is Markovian from its definition. Recall that is the IV’th 
permutation group. We denote the permutation matrix associated with ir E as M n . In particular, the 
permutation matrix associated with the swapping between i and j is denoted as M nij . The state transition 
of {A'(/c)}^l 0 along the algorithm A2 can be written as 


M„..X(k)\x(kj) = (1/deg(i) + 1/deg (j))/N, {i,j} E E. 


(14) 


Since the graph G is connected, the swapping permutations defined along the edges of G form a generating 
set of the permutation group Consequently, given X(0), the set 

{m*X( 0), 7T E q3} 

is the state space of X(k), which contains at most A^! elements. Finally it is straightforward to verify that 
for any given X(0), X(k) is irreducible and aperiodic, and therefore forms an ergodic Markov chain. 

(ii). The statement is in fact a direct consequence from the ergodicity of X(k). We however need to be 
a bit more careful since we assume that X(0) takes value from an arbitrary (not necessarily discrete) 
probability space and the A*(0) are not necessarily independent. We denote the state transition matrix 



for X(k) as P G H NxN . We calculate p\(-) from basic probability equality P(A) = P(-A|Q) under 

YliLi P(Q) = 1 and P (Ci P| Cj) = 0, and then immediately obtain 

N 

Pfc(0 ='52 e J pke iPo(-), ( 15 ) 

s=l 

where e* is the unit vector with the z’th entry being one. It is clear that the above calculation does not rely 
on X(0) being discrete or continuous, and p\{-) represents probability mass or density function wherever 
appropriate. From the definition of the algorithm A2 P is a symmetric matrix and the ergodicity of X{k ) 
leads to 


lim P k = 11 t /N (16) 

k—} OO 

at an exponential rate. The desired conclusion thus follows. 

4 Conclusions 

We have investigated the evolution of the network entropy for consensus dynamics in classical or quantum 
networks. In the classical case, the network entropy decreases at the consensus limit if the node initial 
values are i.i.d. Bernoulli random variables, and the network differential entropy is monotonically non¬ 
increasing if the node initial values are i.i.d. Gaussian. For quantum consensus dynamics, the network’s von 
Neumann entropy is on the contrary non-decreasing. This observation can be easily generalized to balanced 
directed graphs mttaES]. In light of this inconsistency, we also compared several gossiping algorithms with 
random or deterministic coefficients for classical or quantum networks, and showed that quantum gossiping 
algorithms with deterministic coefficients are physically consistent with classical gossiping algorithms with 
random coefficients. 
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